A Convex Formulation for Learning from Crowds
نویسندگان
چکیده
Recently crowdsourcing services are often used to collect a large amount of labeled data for machine learning, since they provide us an easy way get labels at very low cost and in short period. The use has introduced new challenge that is, coping with the variable quality crowd-generated data. Although there have been many recent attempts address problem multiple workers, only few existing methods consider learning classifiers directly from such noisy All these modeled true as latent variables, which resulted non-convex optimization problems. In this paper, we propose convex formulation crowds without estimating by introducing personal models individual crowd workers. We also devise efficient iterative method solving problems exploiting conditional independence structures classifiers. evaluate proposed against three competing on synthetic sets real crowdsourced set demonstrate outperforms other methods.
منابع مشابه
A Convex Formulation for Learning from Crowd
Recently crowdsourcing services are often used to collect a large amount of labeled data for machine learning, since they provide us an easy way to get labels at very low cost and in a short period. The use of crowdsourcing has introduced a new challenge in machine learning, that is, coping with the variable quality of crowd-generated data. Although there have been many recent attempts to addre...
متن کاملConvex Formulation for Learning from Positive and Unlabeled Data
We discuss binary classification from only positive and unlabeled data (PU classification), which is conceivable in various real-world machine learning problems. Since unlabeled data consists of both positive and negative data, simply separating positive and unlabeled data yields a biased solution. Recently, it was shown that the bias can be canceled by using a particular non-convex loss such a...
متن کاملClustered Multi-Task Learning: A Convex Formulation
In multi-task learning several related tasks are considered simultaneously, with the hope that by an appropriate sharing of information across tasks, each task may benefit from the others. In the context of learning linear functions for supervised classification or regression, this can be achieved by including a priori information about the weight vectors associated with the tasks, and how they...
متن کاملActive Learning from Crowds
Obtaining labels can be expensive or timeconsuming, but unlabeled data is often abundant and easier to obtain. Most learning tasks can be made more efficient, in terms of labeling cost, by intelligently choosing specific unlabeled instances to be labeled by an oracle. The general problem of optimally choosing these instances is known as active learning. As it is usually set in the context of su...
متن کاملLearning From Crowds
For many supervised learning tasks it may be infeasible (or very expensive) to obtain objective and reliable labels. Instead, we can collect subjective (possibly noisy) labels from multiple experts or annotators. In practice, there is a substantial amount of disagreement among the annotators, and hence it is of great practical interest to address conventional supervised learning problems in thi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v26i1.8105